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Abstract. Let f(n,£) be the maximum integer such that every set of n points in the 
plane with at most I collinear contains a subset of f(n,£) points with no three collinear. 
First we prove that if £ ^ 0(^/n) then f(n,£) ^ ^(\/t^j)- Second we prove that if 
I ^ 0(n ( - 1_<! ) / ' 2 ) then f(n,£) ^ Q.{^Jn \og l n), which implies all previously known lower 
bounds on f(n,£) and improves them when I is not fixed. A more general problem is to 
consider subsets with at most k collinear points in a point set with at most £ collinear. 
We also prove analogous results in this setting. 



1. Introduction 

A set of points in the plane is in general position if it contains no three collinear points. 
The general position subset selection problem asks, given a finite set of points in the 
plane with at most £ collinear, how big is the largest subset in general position? That is, 
determine the maximum integer f(n,£) such that everg set of n points in the plane with 
at most £ collinear contains a subset of f(n,£) points in general position. Throughout 
this paper we assume t ^ 3. 

The problem was originally posed by Erdos, first for the case I = 3 [8], and later in a 
more general form [9]. Fiiredi [10] showed that the density version of the Hales-Jewett 
theorem [11] implies that f(n,£) ^ o(n), and that a result of Phelps and Rodl [20] on 
independent sets in partial Steiner triple systems implies that 

/(n,3) ^ n(Vnlnn). 



Until recently, the best known lower bound for £ ^ 4 was f(n,£) ^ y / 2n/(£ — 2), proved 
by a greedy selection algorithm. Lefmann [16] showed that for fixed £, 

f(n,£) ^ Q £ (Vnlnn). 

(In fact, his results are more general, see Section 3.) 

In relation to the general position subset selection problem (and its relatives), Bral?, 
Moser and Pach [2, page 318] write, "To make ang further progress, one needs to explore 
the geometric structure of the problem." We do this by using the Szemeredi-Trotter 
theorem [25]. 

We give improved lower bounds on f{n,£) when £ is not fixed, with the improvement 
being most significant for values of £ around ^fn. Our first result (Theorem 3) says that 
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if £ ^ 0{^/n) then f{n,£) ^ ^(1/0)- 0ur second result (Theorem 5) says that if 
£ sC 0(n^ l ~ e ^ 2 ) then /(n,f) ^ n(y/n log £ n). Note that for fixed £ this implies the 
previously known lower bound on f(n,£). 

In Section 3 we consider a natural generalisation of the general position subset selection 
problem. Given k < £, Erdos [9] asked for the maximum integer f(n,£,k) such that every 
set of n points in the plane with at most £ collinear contains a subset of f(n, £, k) points 
with at most k collinear. Thus f(n, £) = f(n, £, 2). We prove results similar to Theorems 3 
and 5 in this setting too. 



2. Results 



Our main tool is the following lemma. 

Lemma 1 . Let P be a set of n points in the plane with at most £ collinear. Then the 
number of collinear triples in P is at most c(n 2 ln£ + £ 2 n) for some constant c. 



Proof. For 2 ^ i ^ £, let Sj be the number of lines containing exactly i points in P. A 
well-known corollary of the Szemeredi-Trotter theorem [25] states that for some constant 
O 1, for all i > 2, 



Thus the number of collinear triples is 



E/n 2 n\ 



E (l) * < E e E s ^ E c * 2 (i + 7) < c E (t + in ) * < n2 

i=2 V ' i=2 j=i i=2 V ' i=2 ^ ' 

□ 



Note that Lefmann [15] proved Lemma 1 for the case of the y/n x y/n grid via a direct 
counting argument. 

To apply Lemma 1 it is useful to consider the 3-uniform hypergraph H(P) determined by 
a set of points P, with vertex set P, and an edge for each collinear triple in P. A subset 
of P is in general position if and only if it is an independent set in H(P). The size of 
the largest independent set in a hypergraph H is denoted a{H). Spencer [23] proved 
the following lower bound on a(H). 

Lemma 2 (Spencer [23]). Let H be an r-uniform hypergraph with n vertices and m edges. 
If m < n/r then a(H) > n/2. If m ^ n/r then 



V > " r r/(r-l) ( m /„)V(r-l) ■ 

Lemmas 1 and 2 imply our first result. 

Theorem 3. Let P be a set of n points with at most £ collinear, for some £ ^ 0(-*/n). 
Then P contains a set of ^(v/jtt) points in general position. 
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Proof. Let m be the number of edges In H(P). By Lemma 1, m/n ^ bnlni for some 
constant b. Now apply Lemma 2 with r = 3. If m < n/3 then a(H(P)) > n/2, as 
required. Otherwise, 



CfffP^ 2n 2ra _ 2 / n 

□ 



Theorem 3 answers, up to a logarithmic factor, a symmetric Ramsey style version of 
the general position subset selection problem posed by Gowers [13]. He asked for the 
minimum Integer GP(q) such that every set of at least GP(g) points In the plane contains 
q colllnear points or q points In general position. Gowers noted that 0(q 2 ) ^ GP(g) ^ 
0(q 3 ). Theorem 3 with £ = q- 1 and n = GP(g) Implies that n( v /GP(g)/ln(g - 1)) < q 
and so GP(q) < 0{q 2 \uq). 

The bound GP(g) ^ £l{q 2 ) comes from the q x q grid, which contains no q + 1 colllnear 
points, and no more than 2q + 1 In general position, since each row can have at most 2 
points. Determining the maximum number of points In general position In the q x q grid 
Is known as the no-three-in-Une problem. It was first posed by Dudeney in 1917 [4] - 
see [14] for the best known bound and for more on its history. 

As an aside, note that Pach and Sharlr [18] proved a result somewhat similar to Lemma 1 
for the number of triples In P determining a fixed angle a 6 (0, 7r). Their proof Is similar 
to that of Lemma 1 in its use of the Szemeredl-Trotter theorem. Also, Elekes [6] employed 
Lemma 2 to prove a similar result to Theorem 3 for the problem of finding large subsets 
with no triple determining a given angle a G (0, 7r). Pach and Sharir and Elekes did not 
allow the case a = 0, that is, colllnear triples. This may be because their work did not 
consider the parameter I, without which the case a = Is exceptional since P could be 
entirely colllnear, and all triples could determine the same angle. 

The following lemma of Sudakov [24, Proposition 2.3] Is a corollary of a result by Duke, 
Lefmann and Rodl [5]. 

Lemma 4 (Sudakov [24]). Let H be a 3-uniform hypergraph on n vertices with m edges. 
Let t ^ \Jmjn and suppose there exists an e > such that the number of edges 

containing any fixed pair of vertices of H is at most t 1 ~ e . Then a(H) ^ f2 e ^ f Vln t^j . 
Lemmas 1 and 4 can be used to prove our second result. 

Theorem 5. Fix constants e > and d > 0. Let P be a set of n points in the plane 
with at most I collinear points, where 3 ^ £ ^ (dn)^ 1 "^/ 2 . Then P contains a set of 
Sl{y/n log^ n) points in general position. 

Proof. Let m be the number of edges in H(P). By Lemma 1, for some constant c ^ 1, 

O 9 9 9 9 

m ^ c£ n + cn hi£ < cdn + cn hxl ^ (a + l)cn \n£. 
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Define t := y/(d+ l)cn\n£. Thus t ^ sjmjn. Each pair of vertices in H is in less than 
£ edges of H, and 

£ < (dn^ 1 -^ 2 < ((d + l)cnln€)( 1_e ^ a = i 1 " 6 . 

Thus the assumptions in Lemma 4 are satisfied. So H contains an independent set of 
size v / lni). Moreover, 




Cl(y/n log£ n). 



Thus P contains a subset of Q(<\/n log £ n) points in general position. □ 



3. Generalisations 

In this section we consider the function f(n,£,k) defined to be the maximum integer 
such that everg set of n points in the plane with at most £ collinear contains a subset of 
f(n,£,k) points with at most k collinear, where k < £. 

Brat? [1] considered this guestion for fixed £ = k + 1, and showed that 

o(n) > f(n,k + l,k) ^ ^(^'-^(lnn) 1 ^). 

This can be seen as a generalisation of the results of Fiiredi [10] for /(n, 3,2). As in 
Fiiredi's work, the lower bound comes from a result on partial Steiner sgstems [22], and 
the upper bound comes from the density Hales-Jewett theorem [12]. Lefmann [16] further 
generalised these results for fixed £ and k bg showing that 

f(n,£,k)>ntA n{k ~ 1)/k Q* n ) 1/k )- 

The density Hales-Jewett theorem also implies the general bound f(n,£,k) ^ o(n). 

The result of Lefmann may be generalised to include the dependence of f(n,£,k) on £ 
for fixed k ^ 3, analogouslg to Theorems 3 and 5 for k = 2. The first result we need is 
a generalisation of Lemma 1. It is proved in the same wag. 

Lemma 6. Let P be a set of n points in the ptane with at most £ collinear. Then, for 
k ^ 4, the number of collinear k-tuples in P is at most c(£ k ~ 3 n 2 + l k ~ 1 n) for some 
absolute constant c. □ 

Lemmas 2 and 6 imply the following theorem which is proved in the same wag as Theo- 
rem 3. 

Theorem 7. Ifk^S is fixed and £ sC 0(^n), then f(n, £, k) > fl k (fJ=Wsr) • n 
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For £ = Vn, Theorem 7 implies f(n,^,k) > n k = V k ( n (2fc-2-/=+2)/2 fe ) = 

Vtk{y/n). For k ^ 3 this answers completely a generalised version of Gowers' ques- 
tion [13], namely, to determine the minimum Integer GP k (q) such that every set of at 
least GPk(q) points In the plane contains q colllnear points or q points with at most k 
colllnear. Thus GPk{q) ^ 0(q 2 ). The bound GPjt(g) ^ Q(q 2 ) comes from the following 
construction. Let m := [(q — l)/k\ and let P be the m x m grid. Then P has at most m 
points colllnear, and m < q. If S Is a subset of P with at most k colllnear, then S has at 
most k points In each row. So \S\ ^ km ^ q — 1. 

Theorem 5 can be generalised using Lemma 6 and a theorem of Duke, Lefmann and 
Rodl [5] (the one that Implies Lemma 4). 

Theorem 8 (Duke, Lefmann and Rodl [5]). Let H be a k-uniform hypergraph with maximum 
degree A(H) ^ t k ~ l where t 3> k. Let Pj(H) be the number of pairs of edges of H 
sharing exactly j vertices. If Pj(H) ^ ni 2fe_ - 7_1_ ' y for j = 2, . . . , k — 1 and some 7 > 0, 
then a(H) ^ fc;7 (f (lnn) 1 /^" 1 )). 

Theorem 9. Fix constants d > and e G (0, 1). If k ^ 3 is fixed and 4 < £ < drS 1 '^! 2 
then 

f n {k-i)/k \ 

Proof. Given a set P of n points with at most £ colllnear, a subset with at most k colllnear 
points corresponds to an independent set In the (k + l)-unlform hypergraph H k +i{P) 
of colllnear (k + l)-tuples In P. By Lemma 6, the number of edges in H k +i{P) Is 
m ^ c(n 2 £ k ~ 2 + nl k ) for some constant c. The first term dominates since £ ^ o{y/n). For 
n large enough, m/n ^ 2cn£ k ~ 2 . 

To limit the maximum degree of Hk+\{P), discard vertices of degree greater than 2{k + 
l)m/n. Let h be the number of such vertices. Considering the sum of degrees, (k+l)m ^ 
h2(k + l)m/n, and so h ^ n/2. Thus discarding these vertices yields a new point set 
P' such that \P'\ ^ n/2 and A(i? fc+1 (P / )) < 4(fc + \)cnl k - 2 . Note that an Independent 
set In H k+ i(P') Is also Independent In H k+ \{P). 

Set t := (4(Jfe + ljcrrf* -2 ) 1 /*, so m ^ 2(fc+i) ntfe and A (^+i( p ')) < * fc - as required for 
Theorem 8. By assumption, £ ^ dn^~ e ^ 2 . Thus 

l-e 



4(fc + l)c 



k 



Hence £i^ +fc - 2 < ^4?, Implying £ < Ci(yfc)^ +fe - 2 = CiCfc)* 1 -*^ for some 
constant Ci(fc). Define e' := 1 - 737%' so e' > (since e < 1) and £ ^ Ci(&)i w . 
To bound pj{H k+ i{P')) for j = 2, . . . , fc, first choose one edge (which determines a line), 
then choose the subset to be shared, then choose points from the line to complete the 
second edge of the pair. Thus for 7 := e'/2 and sufficiently large n, 
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< C 2 (k)nt k £ k+1 ^ 

^C 2 {k) (Ci (k) )*+i-J n t* t (i-0(*+i-i) 

Hence the second requirement of Theorem 8 Is satisfied. Thus 

a{H k+1 (P'))>£l Ke (^{\nt) l l k ) 

( n (k-l)/k , . i/ fc \ 

> ( W H^ fc - 2 ) 1/fc )) j 

□ 

4. Conjectures 

Theorem 7 suggests the following conjecture, which would completely answer Gowers' 
question [13], showing that GP(g) = @(q 2 ). It is true for the \fn x y/n grid [7, 14]. 

Conjecture 10. f(n,y/n) ^ 0(y / n). 

A natural variation of the general position subset selection problem Is to colour the points 
of P with as few colours as possible, such that each colour class Is In general position. An 
easy application of the Lovasz Local Lemma shows that under this requirement, n points 
with at most £ colllnear are colourable with 0(y£n) colours. The following conjecture 
would Imply Conjecture 10. It is also true for the ^fn x y/n grid [26]. 

Conjecture 1 1 . Every set P of n points in the plane with at most \fn collinear can be 
coloured with 0(y/n) colours such that each colour class is in general position. 

The following proposition Is somewhat weaker than Conjecture 11. 

Proposition 12. Every set P of n points in the plane with at most y/n collinear can be 
coloured with 0{\/n\v?^ 2 n) colours such that each colour class is in general position. 

Proof. Colour P by Iteratlvely selecting a largest subset In general position and giving 
It a new colour. Let Pq := P. Let Ci be a largest subset of Pj In general position and 
let Pj+i := Pi \ Ci. Define := \Pi\. Applying Lemma 1 to Pi shows that H(Pi) has 
0(n 2 ln£ + £ 2 rii) edges. Thus the average degree of H(Pi) is at most 0(niln£ + £ 2 ) 
which Is O(nlnn) since rii ^ n and £ ^ yfn. 

Applying Lemma 2 gives \d\ = a(H(Pi)) > cu{ / vnTnn for some constant c > 0. Thus 
n, ^ n(l — cj\Jn In n)\ It Is well known (and not difficult to show) that If a sequence 
of numbers m; satisfies m 8 ^ m(l — \/x) % for some x > 1 and If j > xlnm, then 
rrij ^ 1. Hence if k ^ \fnhvn In n/c then n k ^ 1, so the number of colours used Is 

0(^/Eln 3/2 n). □ 
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The problem of determining the correct asymptotics of f(n,£) (and f(n,£,k)) for fixed £ 
remains wide open. The Szemeredi-Trotter theorem is essentially tight for the y/n x ^/n 
grid [19], but says nothing for point sets with bounded collinearities. For this reason, the 
lower bounds on f(n,£) for fixed £ remain essentially combinatorial. Finding a way to 
bring geometric information to bear in this situation is an interesting challenge. 

Conjecture 13. If £ is fixed, then f(n,£) ^ fi^(n/ polylog(n)). 

The point set that gives the upper bound f(n,£) ^ o(n) (from the density Hales-Jewett 
theorem) is the generic projection to the plane of the [log^ n\ -dimensional £ x £ x • • • x £ 
integer lattice (henceforth [£] d where d := [log^(n)J ). The problem of finding large general 
position subsets in this point set for £ = 3 is known as Moser's cube problem [17, 21], 
and the best known asymptotic lower bound is 0(n/\/lnn) [3, 21]. 

In the colouring setting, the following conjecture is equivalent to Conjecture 13 by an 
argument similar to that of Proposition 12. 

Conjecture 1 4. For all fixed £ ^ 3, every set of n points in the plane with at most £ 
collinear can be coloured with 0£(polylog(n)) colours such that each colour class is in 
general position. 

Conjecture 14 is true for [£} d , which can be coloured with 0(d e ~ 1 ) colours as follows. For 
each x S [£] d , define a signature vector in l} whose entries are the number of entries 
in x equal to 1, 2, . . . £. The number of such signatures is the number of partitions of d 
into at most £ parts, which is 0(d e ~ l ). Give each set of points with the same signature 
its own colour. To see that this is a proper colouring, suppose that {a,b,c} C [£] d is a 
monochromatic collinear triple, with b between a and c. Permute the coordinates so that 
the entries of b are non-decreasing. Consider the first coordinate i in which ai, hi and a 
are not all equal. Then without loss of generality, a, < b{. But this implies that a has 
more entries equal to at than b does, contradicting the assumption that the signatures 
are equal. 
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